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L16 


L15 not (L3 or L6 or L10 or L8) 


34 


L16 


L16 


L15 


L14 and CON-S 


45 


L15 


L15 


L14 


L13 and (env$) 


45 


L14 


L14 


L13 


L12 and (CON-S$) 


48 


L13 


L13 


L12 


Lll and (consensus or ancestral) 


25969 


L12 


L12 




(HIV$ or (human immunodeficiency 
virus) ) 


104532 






L10 


L9 not (L3 or L6) 


7 


L10 


L10 


L9 


hahn.in. and beatrice.in. 


15 


L9 


L9 


L8 


L7 not (L3 or L6) 


8 


L8 


L8 


L7 


korber.in. and bette.in. 


10 


L7 


L7 


L6 


L5 not L3 


6 


L6 


L6 


L5 


L4 and (consensus or ancestral) 


8 


L5 


L5 


L4 


gao.in. and feng.in. 


242 


L4 


L4 


L3 


L2 and CON-S$ 


10 


L3 


L3 


L2 


LI and (consensus or ancestral) 


35 


L2 


L2 


LI 


haynes.in. and barton. in. 


67 


LI 


LI 



END OF SEARCH HISTORY 



FILE 'WPIDS' ENTERED AT 11:53:27 ON 16 DEC 2010 
E HAYNES B F/IN 
65 S E3 

E HAYNES B/IN 
38 S E3 
90 S LI OR L2 

50 S L3 AND (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 
5 S L4 AND (CONSENSUS OR ANCESTRAL) 
E GAO F/IN 
1996 S E3 

10 S L6 AND (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 
7 S L7 NOT L5 

0 S L8 AND (CONSENSUS OR ANCESTRAL) 
E KORBER B T/IN 
L10 9 S E2 OR E3 

E HAHN B H/IN 
Lll 15 S E3 

E HAHN B/IN 
L12 155 S E3 

L13 166 S Lll OR L12 



L8 
L9 



L14 10 S L13 AND (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 

L15 35091 S (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 

L16 370 S L15 AND (CONSENSUS OR ANCESTRAL) 

L17 71 S L16 AND ENV? 

L18 1 S L17 AND (CON-S) 

FILE 'MEDLINE' ENTERED AT 11:56:38 ON 16 DEC 2010 

E HAYNES B/AU 

L19 56 S E3 

E HAYNES B F/AU 

L20 260 S E3 

L21 316 S L19 OR L20 

L22 53 S L21 AND (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 

L23 0 S L22 AND (CONSENSUS OR ANCESTRAL) 

L24 0 S L22 AND (CON-S) 

L25 22 S L22 AND (ENV?) 

E GAO F/AU 

L26 306 S E3 

E GAO FENG/AU 

L27 699 S E3 

L28 1005 S L26 OR L27 

L29 80 S L28 AND (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 

L30 15 S L29 AND (CONSENSUS OR ANCESTRAL) 

E KORBER B T/AU 

L31 141 S E3-E6 

L32 132 S L31 AND (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 

L33 65 S L32 AND ENV? 

L34 2 S L33 AND (CON-S) 

E HAHN B H/AU 

L35 208 S E3 

E HAHN BEATRICE H/AU 

L36 91 S E3 

E HAHN B/AU 

L37 88 S E3 

L38 387 S L35 OR L36 OR L37 

L39 175 S L38 AND (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 

L40 23 S L39 AND (CONSENSUS OR ANCESTRAL) 

L41 17 S L40 AND ENV? 

L42 4 S L41 NOT L30 

L43 222604 S (HIV? OR HUMAN IMMUNODEFICIENCY VIRUS) 

L44 9 S L43 AND (CON-S) 

L45 2017 S L43 AND (CONSENSUS OR ANCESTRAL) 

L4 6 502 S L4 5 AND ENV? 

L47 70 S L46 AND (FUSION OR CLEAVAGE) 

L48 48 S L47 AND PY<2004 

L4 9 40 S L48 AND (CONSENSUS/AB OR ANCESTRAL/AB) 



CON-S is consensus group M polypeptide obtained from the Los Alamos 2000 
sequence listing; it contains a deletion of the fusion domain (~ aa 475-505) 
and a C-terminal stop codon at ~aa 645. 

Consensus env gene is constructed by generating consensus sequences of env 
genes for each subtype of a particular group (e.g., group M comprises 
subtypes A-D, F-H, J, and K) from the Los Alamos HIV sequence database. A 
consensus sequence of all subtype consensuses can be generated to avoid 
heavily sequenced subtypes, [p. 38] 

CON-S was derived from the year 2000 database group M env sequences, [p. 40] 

Con-S gpl40 induced robust neutralization responses in multiple group M 
subtypes (e.g., B and C) . 

CON-S was obtained by aligning the consensus env sequences of group M 
subtypes (A-D, F-H, J, and K) from the 2003 Los Alamos sequence listing to 
obtain a parent sequence. The hypervariable regions (excluding V3) were 
generated through hand alignments of potential N-linked glycosylation sites 
and cysteine residues. The hypervariable loops tended to comprise a shorter 
range of lengths as compared to natural strains. The CON-S env gene was 
generated by converting amino acid sequences of CON-S to nucleotide 
sequences employing the codon usage of highly expressed human genes. HIV-1 
gpl40 Envs with deletion of the cleavage (C) site and fusion (F) domain were 
designated gp!40ACF. 



Fig,29A 
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*fliu.no acid sequence underlined is, the fusion domain that will be deleted in 140CF 
design and the «V underlined with red color is the last ain a id =it the C 
terminus , and all the remaining amino acids after the "M" will be deleted in 140CF 
design. ... 



Ftg.29B 

C0»-S«2003 140CF i pep {620 a.a.) . 
Riclc nam©; 006 

HRVMGIQRt 2HLWF KG j LI GMLI ICSAJ EBLWVTVYX WMKZP U ITLFCAS DflKAYDTE V JVSJAW 1 ?T0PNPO«"IVL 

r \ ' 'i ? v 3 3 f PC< KLTPI ? 'i ' [TTE 

DVVPIDDNKSYRLIWCKTSAITQftCPKVSFEPlPIHYCAPAGEAlLKCSDKKFNGTGPCKMVSrVQCTHGIK?WSTQIJ,T.MGSL 

■*toino acids seen in blue color is for easy identification of the junction of the 
delated fusion cleavage site, 



Fig. 29C 

CODON-OSSIMIKJSD CON-S-2003 140CF.s«ki (1891 at: 
Wick nasua =006 
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CC The invention describes an isolated protein (I) selected from 106 fully 

ZC defined 500-866 amino acid sequences given in the specification. Also 

CC described are: a nucleic acid comprising: a nucleotide sequence encoding 

CC C0N6 HIV gpl60 protein, subtype C ancestral HIV envelope protein, subtype 

CC C consensus HIV envelope protein, subtype C consensus HIV gag protein, 

CC subtype C consensus HIV nef protein, Group M consensus HIV envelope 

CC protein, subtype A consensus HIV envelope protein, Group M consensus HIV 

CC gag protein, Group M consensus HIV pol protein, Group M consensus HIV nef 

CC protein, subtype C consensus HIV pol protein, subtype B consensus HIV gag 

CC protein, or subtype B consensus HIV envelope protein, where the 

CC nucleotide sequence comprises codons optimized for expression in human 

CC cells; a nucleotide sequence encoding (I); or a nucleotide sequence 

CC selected from 89 fully defined 633-2607 bp sequences given in the 

CC specification; a vector comprising the nucleic acid of (1); a composition 

CC comprising at least one protein or nucleic acid above and a carrier; and 

CC inducing an immune response in a mammal. The protein is a consensus or 

CC ancestral immunogen useful for inducing antibodies that neutralize a wide 

CC spectrum of HIV primary isolates and/or that induces a T cell response. 

CC This is the amino acid sequence of HIV CON-S env protein. 

XX 

SQ Sequence 82 9 AA; 



Query Match 99.2%; Score 3338.5; DB 2; Length 829; 

Best Local Similarity 94.9%; 

Matches 620; Conservative 0; Mismatches 0; Indels 33; Gaps I; 



1 MRVMG I QRNCQHLWRWG I L I FGMLI ICSAAENLWVTVYYGVPVWKEANTTLFCAS DAKAY 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 MRVMG I QRNCQHLWRWG I L I FGMLI ICSAAENLWVTVYYGVPVWKEANTTLFCAS DAKAY 60 

61 DTEVHNVWATHACVPTDPNPQEIVLENVTENFNMWKNNMVEQMHEDI ISLWDQSLKPCVK 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 DTEVHNVWATHACVPTDPNPQEIVLENVTENFNMWKNNMVEQMHEDI ISLWDQSLKPCVK 12 0 

121 LTPLCVTLNCTDVNATNNTTNNEEIKNCSFNITTEIRDKKKKVYALFYKLDWPI DDNNS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 LTPLCVTLNCTDVNAT1I1ITT1I1IEEIKNCSFNITTEIRDKKKKVYALFYKLDWPI DDNNS 180 

181 YRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCKNVSTVQCTHG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 YRLINCNTSAITQACPKVS FEPIPIHYCAPAGFAILKCNDKKFNGTGPCKNVSTVQCTHG 24 0 

2 41 I KPWSTQLLLNGSLAEEEI I IRSENITNNAKTI IVQLNESVEINCTRPNNNTRKS I RI G 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 41 I KPWSTQLLLNGSLAEEEI I IRSENITNNAF'T 1 1 "."QLNES'.'E I NCTRPN1I1ITRKS I RIG 300 

301 PGQAFYATGDI IGDI RQAHCNI SRTKWNKTLQQVAKKLREHFHKT 1 1 FHPS SGGDLE ITT 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 PGQAFYATGDI IGDI RQAHCNI SRTKWNKTLQQVAKKLREHFHKT 1 1 FNPS SGGDLE ITT 360 

3 61 HSFNCGGEFFYCNTSELFNSTWNGTNNTITLPCRIKQIINMWQGVGQAMYAPPIEGKIRC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 61 HSFNCGGEFFYCNTSELFNSTWNGTNNTITLPCRIKQI INMWQGVGQAMYAPPIEGKIRC 42 0 

421 TSNITGLLLTRDGGNNNTETFRPGGGDMRDNWRSELYKYKWKIEPLGVAPTKAK 475 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 T SN I TGLLLTRDGGNNNTETFRPGGGDMRDNWRSELYKYKWKIE PLGVAPTKAKRRWE 480 

476 TLTVQARQLLSGIVQQQSNLLRAIEAQQHLLQ 507 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 REKRAVGIGAVFLGFLGAAGSTMGAASITLT . . . . . NLLP_-_IE_-_QQHLLQ 540 

508 LTVWG I KQLQARVLAVERYLKDQQLLGI WGCSGKLICTTNVPWNS SWSNKS QDE I WDNMT 567 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 LTVWG I KQLQARVLAVERYLKDQQLLGI WGCSGKLICTTNVPWNS SWSNKS QDE I WDNMT 600 



Qv 568 v\l IEi\ DTE I1INYTDI I YSLIEESQNQQEKNEQELLALDKWASLWNWFDITNWLW 620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 WMEWDKEI1I1IYTDI I YSLIEESQNQQEKNEQELLALDKWASLWNWFDITNWLW 653 



